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Abstract —Detecting the presence of target subspace signals 
with unknown clutters is a well-known hard problem encountered 
in various signal processing applications. Traditional methods 
fails to solve this problem because prior knowledge of clutter 
subspace is required, which can not be obtained when target and 
clutter are intimately mixed. In this paper, we propose a novel 
subspace detector that can detect target signal buried in clutter 
without knowledge of clutter subspace. This detector makes use 
of the geometrical relation between target and clutter subspaces 
and is derived based upon the calculation of volume of high 
dimensional geometrical objects. Moreover, the proposed detector 
can accomplish the detection simultaneously with the learning 
processes of clutter, a property called "detecting while learning". 
The performance of detector was showed by theoretical analysis 
and numerical simulation. 

Index Terms —subspace detector, unknown clutter, volume 

I. Introduction 

We consider the following subspace signal detection prob¬ 
lem widely existing in communication, radar, sonar and other 
fields of signal processing: 

Problem 1: Let ff be a Hilbert space, Hs C i? be a 
KNOWN signal subspace and He d H he an UNKNOWN 
clutter subspace. Given the sampled data y G H, could we 
determine whether y lies in Hs (B He or entirely in He 
with the influence of random noise? In other words, whether 
y satisfies 


y = s + c + w, 

(1) 

y = c + w. 

(2) 


where s G Hs, c G He, s,c ^ 0 and w is random noise. 

Detecting target signal in certain signal subspace with 
known clutter has been considered by several researchers and 
various schemes has been proposed. Among these works, the 
clutter subspace is commonly modeled to have low rank EH, 
and the most remarkable approach is the Matched Subspace 
Detector (MSD) by Scharf et. al. El, which is actually a 
generalized energy detector using the prior knowledge of target 
and clutter subspaces. Although tremendous variations and 
applications of MSD has appeared 113 1415161 . the key precon¬ 
dition for the success of matched subspace detector is that the 
clutter subspace He must be KNOWN beforehand. It usually 
can not be satisfied in practice, e.g., in radar, reconnaissance, 
mobile communication, etc. As far as we know, because the 
projection-based detectors derived from GLRT can not be 
constructed explicitly, there hasn’t been any detector that can 
work when the clutter subspace is totally unknown. 


However, the structure of clutter subspace could be explored 
by successively sampling from it. As a matter of fact, a generic 
property of randomly sampling in linear space is that the 
sampled vectors are generally linearly independent, so ideally 
the clutter subspace could be "reconstructed" by obtaining 
multiple samples from the clutter subspace, which naturally 
form the basis of the clutter subspace. But it should be 
noted that generally the information of target signal is mixed 
intimately with the clutter in the samples and it is impossible 
to separated the "pure" clutter from target signal so that the 
basis for clutter subspace could be extracted alone. It is the 
core difficulty for detecting the target signal against unknown 
structured clutters. 

In this paper, a novel method to detect target signal em¬ 
bedded in an unknown structured low-rank clutter was given. 
The main idea of our detector is utilizing the geometrical 
characteristic of the sampled data. Here the volume, which 
is a common concept for geometrical objects, is defined to 
measure relationships between subspaces (more concretely, it 
is the parallelotope with its edges being the basis vectors 
of subspace). An intuition of using volume for detection 
is that, since the "volume" of low-dimensional subspaces 
in high-dimensional linear space is zero, the judgment of 
whether or not a target vector lies in certain subspace could 
be transformed naturally to the calculation of "volume" of 
parallelotope built by the target vector and the basis vectors 
of that subspace, in space with pertinent dimension. If the 
’volume’ is zero, then the conclusion can be drawn that 
the subspace contains the target vector, otherwise the target 
vector must lie outside the subspace. Indeed, Although there 
have been researches using similar geometrical approaches to 
perform subspace signal detection ansi . the advantage that 
the proposed volume-based detector requires no knowledge of 
clutter is first discovered and analyzed by us. 

The remainder of this paper is organized as follows: Some 
preliminary backgrounds on the geometrical concepts for 
linear subspaces such as principal angles and volumes were 
summarized in section II. Then the volume-based subspace 
detector was introduced and its property was discussed in 
detail in section III and IV respectively. 

H. Preliminary Background 

In this section, some important concepts of linear space 
geometry were reviewed concisely. Only necessary material 
for our discussion was put forward for the space limitation. 
For details, please see m and reference therein. 




A. Principal Angles between Subspaces 

The concept of principal angles d is the natural gen¬ 
eralization of that of angles between two vectors. Principal 
angles can be used to formulate the relationship between two 
subspaces. 

Definition 1: For two linear subspaces Hi and H 2 , with 
dimensions dim(ffi) = di,dim(ff 2 ) = d 2 - Take m — 
min(di,^ 2 )^ then the principal angles 0 < < • • • < 0^, < 
tt/2 between Hi and H 2 are defined by 

cos 9i= max ufvi, subject to 

Ui^Mi ,Vi^H2 

Wuih = l|tt »||2 = 1, ufuj = 0,v'[vj = 0, 

where j = 1, - ■ ■ ,i — l,i = 1, - ■ ■ m. 

As important concepts of linear space geometry, the prin¬ 
cipal angles are widely applied in scientific and engineering 
fields. The geodesic distance which is the key metric measure 
on Grassmann manifold, as well as numerous kinds of distance 
measures, is defined using the principal angles imi, such 
as the Chordal Distance, Binet-Cauchy Distance and Pro¬ 
crustes Distance commonly seen in signal process applications 
lfT2d . Moreover, the volume of subspace used in this paper 
to construct our subspace detector is also closely related to the 
principal angles. 

B. The Volume of a matrix 

Suppose a full-rank matrix X e d < n, then its 

d-dimensional volume is defined as 


d 

vold(X) (3) 

where ai > a 2 > ■ ■ ■ > crd > 0 are the non-zero singular 
values of X. For X is of full column rank, its d-dimensional 
volume can be written equivalently as ifTOlil 

vold(X) = ^det{XTX). (4) 

The following simple lemma is widely useful in application 
of volume for subspaces. It means that the /c-dimensional 
volume of the basis of subspace with dimension less than k 
is definitely zero. 

Lemma 1: Suppose = [xi,--- ,Xn] be a matrix 

whose columns are a group of vectors in a Hilbert space H 
and dim(span = i, then 

volfe(X(")) = 0, i<k (5) 


d-dimensional volume provides a kind of measure of sepa¬ 
ration between two linear subspaces. Normalized by individual 
volumes of both matrices, the volume of a matrix composed 
of two matrices gives a new kind of correlation, called the 
volume correlation. For n-dimensional Hilbert space H and 
its two subspaces Hi and H 2 with dimensions dim(iTi) = 
di, dim(f/' 2 ) = d 2 , we have 


coiTyoi{Hi, H2) 


VOldi+d2([-^li ^ 2 ]) 
voU(Xi)vold,(X2)’ 


( 6 ) 


where Xi and X 2 are basis matrices of Hi and H 2 , and 
[Xi,X 2 ] means putting columns of matrices Xi and X 2 
together. It is closely related to the principal angles between 
subspaces Qol, 

min((ii ,(^2) 

corrvoi(fFi,fF2) = sin0j(iTi, iT2), (7) 

where 0 < 9j{Hi, H 2 ) < 27r, 1 < j < min(di,d 2 ) are the 
principal angles of subspaces Hi and H 2 . 

It can be seen intuitively from (Q that the volume correla¬ 
tion coiTyofHi, H 2 ) can actually play the role of distance 
measure between subspaces Hi and H 2 . When Hi and 
H 2 have vectors in common, i.e., dim(ffi p| H 2 ) > 1, 
we have coTiyofHi, H 2 ) = 0. On the other side, when 
Hi is orthogonal to H 2 , we have vo\di+d 2 ([^l,7f2]) = 
voldj(Xi) vold 2 (X 2 ), in other words, coiTyoi{Hi, H 2 ) = 1. 
Although volume correlation may not rigorously be a metric 
measure, we still regard it as a generalized distance measure 
that plays a key role in our proposed subspace detector. 

HI. The Correlation Subspace Detector in a 

NOISELESS ENVIRONMENT 

From this section, we are going to introduce the volume 
correlation subspace detector (or VC subspace detector briefly) 
step by step. In order to fully convey the geometrical intuition 
about our subspace detector, in this section, we temporarily 
assume the noise component is not present, i,e., u; = 0 in 
Problem 1. For clearance and easy of understanding, we just 
give some main idea about the geometrical explanation of the 
proposed detector in noiseless environment, and leave rigorous 
analysis of the noisy situation for the next section. 

A. Main Idea 

Unknown clutters with subspace structures were the primary 
obstacle for efficient detection of target signal. To reach the 
purpose, the designer of detectors must And the way to clarify 
the intrinsic construction of clutter subspace. Just as most of 
the traditional approaches for background learning, multiple 
samples were adopted to explore the clutter subspace. The 
following observation is the foundation for the exploration of 
clutter subspace. 

• Suppose ff be a n-dimensional Hilbert space, 
Xi, - ■ ■ ,X}., k < n be randomly sampled vectors 
from H, then in the generic situation, we have 

dim(span{aii, • • • , X/^}) = k, (8) 

In words, random samples Xi, - ■ ■ ,xj. are generally 
linearly independent. 

• In the case of k > n, then in the generic situation, we 
have 

dim(span{a;i, • • • ,Xk}) = n, (9) 

In words, Xi, - ■ ■ ,Xk are linearly dependent. 

Let He be the unknown clutter subspace with unknown di¬ 
mension di, Hs be the known target subspace with dimension 










^ 2 , 2 / 1 , • • • , Udi be samples representing our sample subspace. 
Without loss of generality, we assume dim{Hs H He) = 0 
throughout this paper. The critical point when we explore 
the clutter He is that, the sample subspace may contain 
both clutter and target signals in general. In other words, we 
cannot guarantee that the sample subspace is a "pure" clutter 
subspace. What we commonly get are samples like; 

r/i = Si + Ci, Si e Hs, Ci e He, * = 1,2, • • • , (10) 

It is impossible to separate the clutter and target signal apart 
and build the clutter subspace from these How does the 
information of sample subspace be mined effectively? 

The volume correlation between subspaces is helpful for us 
to eliminate the impact of mixing of clutter and target signal. 
In fact. Let Si, • • • , Sd^ be the known basis vectors of Hs- 
It has been mentioned that in the generic scenario, different 
j/i sampled from Hs © He are linearly independent. In other 
words, innovative directions of basis vectors in Hs (B He 
are revealed continually along with the sampling process as 
follows, 

dim(span{yi, Si, • • • , = 1 + ^ 2 , 

dim(span{yi, 2 / 2 , Si, • • • , Sds}) = 2 + ^ 2 , 


dim(span{2/i, • • • ,2/di,Si,--- ,5^3}) = di +^2, (H) 

The question is what will happen next. The above process 
could be analyzed in another way from the viewpoint of 
volume mentioned earlier. In particular, the core idea of our 
subspace detector in noiseless environment could be illustrated 
fully using volume. Firstly, when both the signal and clutter 
are present, we have 

voli+d2([yi,si,--- ,.SdJ) > 0, 

vol 2 +d 2 ([ 2 /i, 2 / 2 ,Si,-'- ,5^2]) > 0 , 


vold^+d^iiyi,--- ,ydi,sie-- > 0, (12) 

Next, the magic will happen for the next dimension, i.e., when 
there are di + d 2 + 1 sample vectors, there will be 

+ l ([ 2 / 1 ,*'' ,2/fil,2/ttl + l,^l,‘'',^fi 2 ]) b, (13) 
while on the other hand, 

voldi+i(span{yi,--- ,yd^,ydi+i}) >0. (14) 

The reason is because, di + c ?2 + 1 sample vectors in this 
scenario have not spanned the entire subspace Hs © He 
according to the previous statement of randomly sampling; 
but yi,--- ,ydi,ydi+i,si,- ■ ■ ,Sd^ can span Hs © He, in 
another word, 

dim(span{yi,--- , , ydi+i, Si, • • • ,• 8 ^ 2 }) 

= dim(Xs © Xc) = di + 0 ( 2 , (15) 

On the other hand, if the sample subspace only contains pure 
clutter, we obtain 

voldi+d2+i([2/i,--- ,2/di,ydi+i,Si,'-- , 5 ^ 2 ]) =0, (16) 


and 

voldi+i(span{yi,-• • , y^^, ydi+i}) = 0. (17) 

(O, (O, (Hsii and (O indicates that, di + 1 is the criti¬ 
cal number of samples for detection of target signal in the 
background of clutter with unknown subspace structure, i.e., 
the "breakpoint". The knack of detection in this noiseless 
situation is, sampling continually, computing the volume of 
parallelotope spanned by all the sample vectors and known 
basis of target subspace at various dimensions and inspect 
the change of results. Once the volume vanishes, it means 
the number of samples reaches the critical point. Then the 
process of sampling should be stopped and the volume of 
sample vector themselves is calculated. The decision can be 
made based on whether the result is zero, i.e., whether Gil 
or Gl- 

IV. The Volume-Correlation Subspace Detector in 

NOISY environment 

A. Main Idea 

The main problem here is the sample subspace has been 
contaminated by random noise and can not be used directly 
to compute the volume correlation in VC subspace detec¬ 
tor. Therefore the noise must be cleared in advance. For 
most statistical signal processing algorithms concerned with 
subspaces, such as MUSIC, ESPRIT and so on, the target 
signal and random noise are separated into signal subspaces 
and noise subspaces by eigen-decomposition of correlation 
matrices firstly for further treatment. It implies the natural 
strategy of extracting signal subspaces for follow-up analysis 
and discarding noise subspaces simply for noise elimination. 

To be specific, we reconsider Problem 1 where w is 
assumed to be white Gaussian noise with zero mean and 
variance . Traditional subspace methods mentioned above 
deal with the correlation matrix Ry of the sample data y, 
which is denoted by 

Rj, = E{yy^} (18) 

According to our problem setting, the eigenvalues of R^ could 
be listed as 

Ai > A 2 > • • • > Afe > Afe+i = • • • = A„ = (T^, (19) 

and the corresponding eigenvectors are 

9ij 92, ■ ■ ■ , 9fe, 9fc-i-ij ■' ■ , Qn 

Denote Qse ■= [9i, 92 ,• • • ,gfc] G Qn ■= 

[qfe+i, qfc+ 2 , • • • , 9n] G It is clear that 

s]id,n{Qse) = Hs ® He, k = di+d 2 , (20) 
when both signal and clutter are present, and 

span(Qsc) =-ffc, k = di, (21) 

when the sampled data contains "pure" clutter. The 
span(QsC') and span(QAr) are commonly called signal sub¬ 
space and noise subspace. Asymptotically span(Qsc') could 
be used as proxy of Hs © He (or He) and the main idea in 
previous section is workable as well in the noisy environment. 




B. The proposed VC subspace Detector 

The VC subspace detector is extended to noisy scenario as 
follows; 

• Initial Step : Denote the received data {j/i,--- ,?/«} 

by Obtain {si, • • • , the orthonormal basis 

vectors of known target subspace and denote it by Qs- 
Let the sample covariance matrix be = q. Index i is 
set to 1. Set two thresholds T and e at appropriate values. 

• Step 1 : Get the new sample yi, compute the covariance 
matrix as 

R(*) = (22) 

I I 

Assume the eigenvalues of be 

Ai > A 2 > • • • > Xki > = • • • = A„, (23) 

and the corresponding eigenvectors be 

qiA2,--- AkiAk,+!,■■■ An, (24) 


approximations of clutter subspace. At the decision point, 
background learning is ended spontaneously and the decision 
on the existence of target will be made naturally. There is no 
need for extra effort of background learning. The learning and 
detection is merged perfectly in VC subspace detector. We call 
this interesting property "Detection while Learning". Our VC 
subspace detector could be listed as blind detecting methods. 

Remark 2. It should be noted that the subspace is ac¬ 
tually an estimation of the real signal subspace. The accuracy 
of this approximation had been studied extensively ifBITel 
and the feasibility of had been proved asymptotically. 
Hence we can expect the proposed VC subspace detector 
will asymptotically approximate the VC subspace detector in 
noiseless scenario, and this expectation is validated by the 
theory in next section. 

Remark 3. The dimension of signal-plus-clutter subspace (or 
clutter subspace), i.e., ki in (l23t actually needs to be estimated. 
Since there are various methods can be used, like AIC or MDL 
ifiTlTSl , we will not discuss this topic in detail here. 


obtain the estimated basis of the sampled subspace by 

,qfcj, (25) 

• Step 2 : Compute the test quantity as 

r(H«)=volfe,+d,([g«, Qs]), ( 26 ) 

. step 3 : if 1/T(i2(®)) > T, conclude the existence of 
target signal and exit; 

else if |l/T(i?(®)) — l/r(i2*^®“^))| < e, conclude the 
non-existence of target signal and exit; 
otherwise, set i = i -f 1 and go to step 1; 

Remark 1. It should be emphasized that the most remarkable 
advantage of VC subspace detector is its feature of "Detecting 
while Learning". To be specific, the detection could be com¬ 
pleted without separated sessions for background learning with 
VC subspace detector. As well known, background learning is 
very popular in adaptive processing for radar, communication 
and other signal processing problems. Channel equalization 
in communication transmission, CFAR (Constant False Alarm 
Rate) operation in radar detection and estimation of covariance 
matrices for clutter echoes in STAP (Space-Time Adaptive 
Processing) all belong to sessions for background learning. 
There are double common defects for all these schemes. The 
first is that the efficacy for estimation of clutter background 
might be influenced heavily by existence of target signal, so 
called as target leakage in literatures; the second is the non- 
homogeneousness widely existed in clutter environment which 
easily leads to mismatch of learning consequence with the 
actual clutter scenario at the target location. Nevertheless, VC 
subspace detector stands far away from these trouble because 
the process of background process is accomplished implicitly 
and simultaneously with the detection operation. Along with 
the raw data being sampled and put into work sequentially, the 
volume correlations are examined and tested constantly until 
the threshold is reached. The information of clutter subspace 
is being learned in the form of volumes of low-dimensional 


C. Theoretical Property of the VC Subspace Detector 

To avoid the vagueness brought by asymptotical conclusion 
of the performance of VC subspace detector in the noisy 
background, we give some non-asymptotical analysis on the 
capability of our detector with knowledge of random matrices 
and concentration inequalities. We have the following result. 

Theorem 1: Let H be n-dimensional Hilbert space, Hs 
and He be target and clutter subspaces of H respectively, 
Hs n He = {0}, dim He = di, dim iTs = d 2 , 

yi = Xi + Wi, i = 


where yi is the sampled data, Xi G Hs © He (or He), Wi ~ 
J\f{Q,aAn) is Gaussian white noise. Denote the eigenvalues 
of the covariance matrix of received signal Ry by (fT^ . 

If the target signal presents in sample data, then for any 
0 < e < 1 and d > 0, if 
1 + £ 


m > 


(v^dTT-i)2 


XiXj 


d2-\-di 


+ {n — di — d 2 ) 




^ (cr^ - Ai)2 


6^2 + ^! 

£ *^^® “ 

\ i/j 

then there exists a constant C > 0, such that 
|r(H(®®®))2| < (5®*= + 0(5®'^+!). 
holds with probability 

, (di + do) • n • £^ - 

P > 1 - exp{-^i-^-}. 

On the contrary, in the case of non-target, for any 0 < e < 1 
and (5 > 0, when 

1 + £ 


(27) 


(28) 


(29) 


m > 


iVsTT-1)^ 


d-]^ d-]^ 

(EE 


Xi Xj 


i=l 0=1 (^® ) i=l 


E(’^- di) 


{A - Xi) 


r) , (30) 


















we have 


Plot of 1/T(i?*”**) with respect to m (di = 40,d 2 = 10) 


|r(HW)2 _ < sd,-i{Q'^Pi-Qc)S + 0{S^), 

(31) 

holds with probability 

P > 1 - exp{- ^^ c ^ 

here t{Hs,Hc) = voldi+d^ ([Qs, Qc]) > 0 is a constant 
related with Hs and He, Qs, Qc are the orthogonal bases 
of Hs and He, respectively, and Pg is the projection matrix 
onto Hg, Sk{A) for matrix A G is defined as: 

Sk{A) := ^ CTii • • • CT^, 1 < fc < n. (33) 

l<zi <n 

Theorem [T] describes the performance of our VC subspace 
detector in noisy environment. The main result (l28T l. together 
with dZTl l and ( |29] |. implies that when the target signal is 
present, the test quantity 1/T(i2(™() of VC subspace detector 
will tend to infinity with an overwhelming probability, when 
the number of sample data is sufficient large. On the other 
hand, (l3Tl l together with (l30l l and (l32] | ensures 
to tend to a finite value. Therefore the decision point of 
this detector is to test whether increases over a 

threshold, or stops increasing at a finite value. The result of 
Theorem[T]implies that the output of our VC subspace detector 
will remain almost unchanged no matter what clutter is given, 
whether or not there is noise, so far as that we have enough 
sample data. This shows the asymptotic effectiveness of our 
VC subspace detector. 

The effectiveness of detector 1 was demonstrated by numer¬ 
ical simulation in figure [T] Here n = 1024, di = 40, d 2 = 10, 
SNR = — lOdB and the target and clutter signal were chosen 
randomly from corresponding subspaces. The average values 
of 100 monte-carlo simulations of the volume correlation 
1 /T(i?("*^) with respect to different m are plotted, and the 
values of the detector output with respect to each simulation 
are showed by a scatter diagram in the small sub-figures. It 
can be seen from the figures that as m increases, the test 
quantity 1/T(i?(’"() converges to infinity when there is target 
signal, while there is no target signal, 1/T(i2(™)) converges 
to a finite value. Therefore, as a whole, the simulation result 
verified the validity of VC subspace detector. 

V. CONCLUSION 
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Fig. 1: The reciprocal of volume correlation 1/T(i2(™() 


Appendix A 
Proof of Theorem[T| 

In order to prove Theorem [T] two theorems as intermediate results 
are needed. 

Theorem 2: Let H he a n-dimensional Hilbert space, Hs and 
He be the subspaces of H corresponding to target and clutter 
respectively. Hs H He = {0}, dim iTc = di, dim iTs = d^. 
Suppose j/i, i = 1, • ■ • , be sampled data either containing both target 
and clutter, 

yi=x\^'^+xf\ i = (34) 

or only containing clutter 

yi^xf\ i = (35) 

where G Hs and G He- Let 

V'’") (36) 

and and Qs be orthogonal matrix with columns being the 

basis vectors of span(V^"*^) and Hg, then we have the following 
monotone property 

l/y(y(i)) < i/r(y(2)) < ... < (37) 


In this paper, we propose a novel subspace detector that can 
detect target signal buried in low-rank clutters and random 
noise without knowledge of clutter subspace. The proposed 
detector utilize the geometrical relation of clutter and target 
subspaces. The target signal could be detected without prior 
learning of clutter structure with our detector. The influence 
of clutter will be eliminated simultaneously with the detec¬ 
tion being performed. Theoretical and numerical analysis has 
validated the effectiveness of our new detector. 
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holds for both scenarios, where = vol^+^j Qs]). 

Theorem 3: Under the same assumption of theorem 111 the suffi¬ 
cient and necessary condition for existence of target signal in sampled 
data is that: there exits an integer k, such that 

= voWd,([Q^\Qs]) = 0, (38) 

for all m > k, and fe = di. Or equivalently, the sufficient and 
necessary condition for non-existence of target signal in sampled data 
is that: 

TiY^m-)) ^ voU+d,([Q^\ Qs]) > 0, (39) 

for all m > 1. 






















A. Proof of Theorem | 2 ] 

We will prove the monotone property l l37t . An useful Lemma is 
proved at first. It is just a simple property with very clear geometric 
intuition for volume of subspaces. 

Lemma 2: Let = [vi, ■'' .Vk] & and X £ R"'"' be 

two matrices, then 

(40) 

where is the orthogonal complement of projection matrix on 
column space of matrix A. 

Proof: According to the definition of volume for subspace 0, 
we have 

= det([X,y('“)]'^[X, 

-detl ) 


/ 

(fe-l) 

; \ 

(y(fc-l))T^ 

7 

1 

1 (y('=-i))^y, 1 

V yfx 


1 VkVk ) 


(41) 


B. Proof of Theorem | 5 ] 

Let Hs and He be target subspace and clutter subspace respec¬ 
tively. dim(LZ's) = d 2 , dim(LZ'c) = di- Assume there exists target 
signal in received data {yi,i = 1, • ■ • , m}, that is to say, 


yi = Hsoci + Hefii, 

(53) 

for some i £ S <Z {1, 2, ■ • ■ , m}, and 


yi=HcP^^\ 

(54) 


for other i £ {1, 2, ■ ■ ■ , m} \ S. Then under the generic hypothesis, 
we have 

rank([yi, • • • , y^]) = m, (55) 


for m < d\. Hence the result of successive orthogonalization could 
be written as 






(56) 


It should be stressed that in the case of fc = di -f 1, we still have 


rank([yi,-■ • ,ydi-i-i]) = di + 1, (57) 

because of the presence of target signal. In other words, 

- (58) 


Using Schur complement formula, 
det 


^ ^ 1 =det(A)det(Z?-BA""C), 


we obtain (in the next page) 

Because I — is idempotent matrix. 


(42) 


(44) 


(I - P[X,Y(k-l)]) = I - P[X,Y(k-l)], 

we obtain 

voil+ii[x, y «]) = Yoil_^+ii[x, (/ - 

= vo\l_,+,i[X,Y^^-^^])\\P^^^ v(^-Liyfcf 

(45) 

This is just what we want to prove. ■ 

With the notations in theorem|2l when m < dim(i/s (BHc), we 
have 

r(y(-)) ^ vold,+^([Qs, (46) 

where is matrix with columns being 

orthogonal basis vectors for yi"*L Moreover, 


Qy(m) — [Qytm-l) , 


(m). 


(47) 


we have 


For y all of its di -f 1 linearly independent directions includes 

two parts, one with di directions come from clutter subspace He 
and the other one direction is contributed by target subspace Hs- 
Let Qs be the matrix with columns being the orthonormal basis 
vectors of Hs, l l58t means that 

rank([Qs, = da -f di, (59) 

and the number of nonzero columns of [Qs, is di -l-da -I-1. 

Using Lemma [T] we obtain 

VOldi+d2 + l([Qs,Qy"^^^]) = 0, (60) 

Take k = di + 1, the necessary part of theorem has been proved. 

On the contrary, under the generic hypothesis, if there exists k 
such that 

volfe+d 2 ([Qs, Qy^]) = 0, (61) 

then there must be target signal in sample data t/i, • ■ • ,yk- 

Assume this was not the case, then each sample yi contained no 
target signal, we have 

V^'“> = [yi,--- ,yfe]cffc, (62) 

therefore the orthonormal basis matrix Q^^ of y"* satisfied 


(m) 

q\r = 


{I - 


(I-Q^-'>(Q 


Y 


DVr, 


= 

= Py(- 


-1) t/r. 


-1) t/r. 


Using Lemma 12 Let X — Qs, Y — Q^\ we obtain 
T{Y^^'>) = vold,+^_i([Qs, Q^“'>] 


(48) 


[Qs ’Q 


(m-l),qY 


(m). 


= T(y 


(m-l),. 


„("i) I 

■^(■m-l),qY II 


lQs-Q\ 

Take into account the property of projection matrices. 


(49) 

(50) 


Q\ 


= [qy^9'y^ 


,q^Y’] 


for m < ki and 


qM _ r^(i) 


= [9f^^qy^ 




for m > ki- According to 0 and 0, we obtain 


voW,,([Qs,Q^^]) 


voWfc,([Qs,Q^^]) 
volm(Qs) volfc2(Q^') 
corrvoi(Qs,Q^') 

min(«ii ,d2 ) 

n sin6l,(Xi,X2) >0. 
7=1 


(63) 

(64) 


(65) 


IIP (n 
" [Qs,Qf 


0^11 
iVv II 




= 1 , 


we have 




(m-1),. 


Thus I37t and theorem |2 has been proven. 


(51) Considering the monotone relation l l37b . we have 

vol^+fc2([Qs,Q^^]) >0, Vm e N, (66) 

(52) 

Contradiction! We have verified the sufficient part and the whole 
theorem has been proved. 





vol|+i([X,'K^''^]) = det 








det yi yt - yt [X,Y' 


(fe-i)l 




^Tyik-l) 


^Y^k-i)^TY(k-i) 


\X.Y 


(fc-l)lT 


voii.^+i{[X,Y^*^-^^])yUl - PixY^k-n])v^, 


Vk 


(43) 


C. Proof of Theorem Q] 

To complete proof of Theorem [T] several lemmas are required as 
necessary tools. These lemmas concerned with asymptotical distri- 
hution of eigenvectors of sample covariance matrix, concentration 
bounds and matrix perturbation. 

Lemma 3: (m Consider the matrix with columns 

being the r eigenvectors of sample covariance matrix 
corresponding to the largest r eigenvalues, 

= [<7i,<72,--- ,<7rl, 

its asymptotic distribution (for large m) is jointly Gaussian with mean 
Q = [qi, < 72 , ■ ■ ■ , <7r], 
and covariance • • ■ , where 


\ ^ \ ^2 
^(m) Ai r Aj X , ^ t'I 


j=i 


E — [ei, • ■ ■ , e^l £ 


II^^IIf < (l + £)^Tr(E0, 

i=l 

holds with probability 

P>l_exp{-^-^}. 


Si = ViAiV/ , 


then 


fii = Vf Bi, 

Ci ~ A/'(0, Ai), llfiills = lleilli 


Denote the elements of vector et by 

Bi = [ei,i, • ■ • , • • • ei,ri] , 

then different iij are independent and satisfy 

eij ~X{0,alf, l<j<n. 


(75) 


(76) 


Now we stack all these vectors ei, 1 < i < d into a single vector, 
i.e., we let 

— f'~T' '-T '-TiT — TrnrXTi rn'i\ 

e [gi , e 2 , • ■ ■ ? ^n] ^ ^ ? (77) 


then we have e ~ A/'(0, A), 

A = diag(Ai, ■ • ■ , A^) = diag(crf 4 , ■ • ■ • • • ,cr-i, • • • ,cr-„j 

Therefore, (ED is equivalent to 


II li ,||2 II - ||2 II -||2 

II-E-IIf — 2_^ \\e-i\\2 — l|e||2- 


(78) 


i = 1, • ■ • , r, and 

- qi)(q^"‘'- qfe)^ = S?"*'• 5i,fc, (67) 

where Ai > A 2 > • • • Ar > Ar+i = • • ■ = An = cr^ are 
eigenvalues of the covariance matrix Rn in dlSt . with qi,- - ,qn 
the corresponding eigenvectors. 

Lemma 4: For the random matrix 


( 68 ) 


where ei ~ A/'(0, Si), 1 < i < d, and E(eie^) = Si • di^fc, then for 
any 0 < £ < 1, there exists a constant C > 0 that depends on Si, 
such that 


It is well known that the norm of a Gaussian random vector 
will concentrate around its expectation. It has been proved that the 
norm of an i.i.d. Gaussian random vector will concentrate around 
its expectatiorQ. The problem of the concentration of ||e||| here is 
only slightly different with the one in oa. In particular, the elements 
of e have different variances in our case and the i.i.d hypothesis is 
violated to a small extent. Therefore, the proof will be adapted from 
the proof of Theorem 4.2 in m- So only the different part will be 
given in the following proof. 

Firstly, we have 


i=l j=l i=\ 


(79) 


(69) 


(70) 


Proof: From the definition of Frobenius norm, we know that 

\\E\\l=^\\ei\\l. (71) 


Then follow the same approach as da and utilize Markov’s 
Inequality. For any parameter /3 > 0 and A > 0, we have 

r r 

nilelli >=lP{exp(A||g||i) > exp(A/lETr(Si))} 

i=l i=l 

r 

= nE{exp(A||ei|| 2 ) > exp(A/3Tr(Si))} 


< 


n 


For any 1 < i < r, Bi ~ Af(0, Si), the eigenvalue decomposition 
of Si G R"'^” could be written as 


E{exp(A||ei||i)} 
exp(A,9Tr(Si)) 


E{exp(Aefj)}^ 


(72) 


where the diagonal matrix Ai := diag(Afi,-- - ,Ai„) and Af^ > 
Ai 2 > ■ ■ ■ > Af „ > 0 are eigenvalues of Ai. 

Let 


exp(A/3<72.) 

The moment generating function of the Gaussian random variable 


(73) 

(74) 


E{exp(Ae?j)} = 


1 




(80) 


^Chapter 4, (m 









let 


<^max •— (^min •— (81) 

we have 

r / , ^ 2 \\ T-nj'l 

P{||g||2 > /3ETr(S0} < ( "T-SSaT ) ’ 

(82) 

The rest of proof is the same as Theorem 4.2 in (m and will 
be described briefly. Replacing A with its optimal value such that 
the right side of l l82b is minimized, and regarding some formulas 
involving (Tmax and (Tmin for a constant C, we can derive the result 
of this lemma (which is also the result of Corollary 4.1 in fT^ under 
i.i.d hypothesis); 

P{||g||i > (1 + e) ^ Tr(E0} < exp(-^-^^), (83) 

i—1 ^ 

holds for any 0 < e < 1, where C > 0 is a constant depending on 
rTmax and CTmin. H 

Next, the lemma will be presented to estimate the influence of the 
error between sample eigenvectors and its true value on the 

volume correlation computation. Motivated by the relation between 
volume and determinant, the matrix perturbation theory was utilized 
to derive the result needed. 

Lemma 5: (Corollary 2.7 in (20)) For the matrix A G and 

the perturbation matrix E G we have 

• If A is full-rank, then 

n 

I det(A + E)- det(A)| < ^ s„-i(A)||f;||^, (84) 

i=l 

• If rank(A) = k for some 1 < k < n — 1, then 

k 

I det(A -b E)\ < \\E\\r'' Yl Sk-^{A)\\E\\i. (85) 

i=0 

here Sk (A) is defined as the fcth elementary symmetric function 
of singular values of matrix A G R"^": 

Sk(A) := Y aij^ ■ ■ ■ aif., 1 < k < n. ( 86 ) 

Now we will prove theorem [T] Assume the sample data be 

= ,ymj, (87) 

and its sample correlation matrix be then 

the volume correlation is of the form of: 

T(R("‘^) =vola,+k^([Qs,Q("‘^]), (88) 

where Qs is the matrix with columns being the orthonormal basis 
vectors of target subspace Hs and is the matrix with columns 

being the eigenvectors of corresponding to relatively large 

eigenvalues. Without lossing generality, assume 

= ,ykj, (89) 

We have 

T(R("‘^)=vol,,+k^([Qs,Q^Y) 

= det^/" (QsQs) det^/"(/„ - (Q^^^fQs (QsQs) QWY 

(90) 

where Pg is the orthogonal complement of the projection matrix 
onto the target subspace Hg. 


We noticed that approximate eigenvectors in J90b should be 

replaced by its true value to obtain the conclusion of theorem [T] It 
is natural for using lemma to estimate the error of approximation 
because the main object of calculation involved in i90l l is determinant. 
Firstly assume could be expressed as following linear random 

perturbation model, 

(91) 


where 


= [eY,- 

with eY and 

according to (|63, we have 




(92) 


are mutual independent. Then 


y,(m) _ ^ r 

* m L-^ (Ai — A, )2 

j=i 

3¥=i 

Therefore l l90b becomes 


QiqJ + 


n 


E 

j^km + l 




(93) 


T{r(Y = defi/^ 

= det^/^ ((P's + Ps P^"*’)^(Ps + Ps , 

(94) 


for simplicity, let 


V = P^qY, VK = P/p(’"\ 
then l l94b becomes 

y(jj(m))2 ^ det ((V +Fr)^(V +M^)) , (95) 

Let 

A = y^V, E ^v'^w+ w'^v+ w'^w, 

then we have 

= det(A + P), 


where 

A^iQYfPsQY, ( 96 ) 

E = PsQY + (E^^'>)'^Pg e‘-”^\ 

(97) 

From lemma [4] we noted that its two conclusions were distin¬ 
guished by the rank of matrix A. It indicated that the rank of A was 
a critical factor for accuracy of approximation. In fact, it determined 
the infinitesimal order for error of approximation. So the rank of A 
should be analyzed. 

According to l l96b . we have 

(QYfPsQY = Ikm - (QYfPsQY- ( 98 ) 

The rank of A is closely related to the rank of in other word, 

the structure of subspace spanned by received data Yi'^\ There are 
two possibilities for the structure of 

• If the target signal presents, then span(Q^^) n Hg ^ 0 and 

span(< 3 ^^) n He 7 ^ 0 ; 

• If the target signal doesn’t present, then span(<3^^)niTs = 0 
and span(Q^^) n He 7 ^ 0 ; 

In the first case, because span(<5^^) n Hg 7 ^ {0}, it is assumed 
that of km columns of qY were contributed by target subspace 
and the others came from clutter subspace. The corresponding result 
on the rank of (QYY'PsQY 6 e summarized in the following 
lemmas 






Lemma 6: Under the hypothesis of existence of target signal in 
sample data, we have 

rank((Q^"*>)^P^Q^”*^) = fc™-fe®, (99) 

where 1 < km < min(m, di + ^ 2 ), 1 < km < min(fcm, d 2 ),and all 
of its nonzero singular values (eigenvalues) are 1. 

Proof: It should he noted firstly that km can not excess di + d 2 
which is the intrinsic dimension of Hs © He, and km should not 
be larger than di no matter how large the number m of sample data 
is. Generically, we have km < fcm+i- Once km' = di +d 2 , we have 
k„ = di + d 2 when n > km' ■ 

It is natural to assume Qy = [Qs, G where 

Qs £ is matrix with columns bein^ parts of orthonormal ba¬ 
sis vectors for Hs, and Qs G jg matrix with columns 

composed of vectors in Ps{Hs © He)- and g is 

a orthogonal matrix. Hence from 1981 we have 

{Q^Y^fPsQ^Y^ = Ikm - {Q^Y^fPsQ^^^ 


Proof: Now we are ready to prove Theorem [T] According to 
I in Lemma [3 and (El, we have 


y(jj(m))2 =det(A + i;) 


<I|-E||2 


km km+k^ S,^_,S_,(A)||£||L 

i^O 

{A)\\Ef,^ + 0(11.^112^™+'), (110) 


where 

||-E'||2 = 

< 2||(Q^^)^P/P^’">||2 + ||Ps 

< 2||Q^'||2||Ps^P^"*’|| 2 + ||Ps 


= Ikm - 

= hm - (B^^Y 


Qs 


{Qsf 

Q^PsQs QsPsQ 


Ps[Qs,Qt]B 


±]n(m) 


= 2||Ps P("‘>||2 + ||P/P^"‘'||i 
< 2||Ps + ||Ps 


( 111 ) 


iQYPsQs {QYPsQi 


j(m) 


Then, according to Lemma [3] and Lemma H we have 


because 


we have 


-I ~Af(0,Ps S,?'"^(P/)'^), 

PsQs = Qs, PsQs = 0: (100) and for any e > 0 


( 112 ) 


QsPsQs = Ik^, 

QlPsQi = {QsfPsQs = {QsfPsQi = 0, 

therefore 


{qYYPsQY = Ikm - 




Ik^ 0 
0 0 

0 0 
0 Ikm-kS 


j(m) 


j(m) 


( 101 ) 


( 102 ) 


(103) 


Let 

_ 

then we have 


j{m) 

,(m) 


||p/pl"*)||| < (l + e)£Tr(P/Ei(P/)^), 

i=l 

holds with probability 

TO. ^ 1 r km-n-e^ 

P > 1 - exp{- - -}. 

Consider the right side of i ll 13L we have 

km km / H km •» •» 

Y:T,(Pt&Ps, = T.YT. a-Gtp 


(113) 


j(m) 


(104) 


{qYYPsQY = {bY^bY, 

assume the singular value decomposition of bY Oe 
p'”*) = C7[A,0]y, 


(105) 


i = l 


because 


i=l 


sY 




+ E 

i = fem + l 


Aid" 


(a2 - Ai)' 


■Tr(P/u,-rxJp/'^ 


)) 


Tr(Ps UjuJPs = Tr(iiJP 5 nj) < 1, 


(114) 


(106) we have 


where U G R('=™-'=m)x(fcm-fc® ) y g R'=-xt=m orthopnal 
matrices and A is diagonal matrix, because of the orthogonality of 

(107) 


y]Tr(Ps^S.Ps) < 


p(m)(p(m))T ^ UJ^2jjT ^ 


hence we have A = Ikm-kS and 

(p(m))Tp(m) ^ y 


\i\j 


^km-k^ 0 

0 0 


km km 

EE (A,-A,) 

1=1 j=i ' ' 1=1 

3¥'i 






(<t2 - XiY 


(115) 




(108) 


Combine ( fTTSt and i ll 13b . we have for any e > 0 and 0 < <5 < 1, if 


Then the following conclusion could be drawn: If target signal 
presents in sample data, then we have 




rank((Q^')^Ps = km - k^, 

and all of its non-zero singular values (eigenvalues) are 1. 

Proof of Theorem [l| 


(109) 


EE rx,.-YT2 +E(^-fe" 


Xia^ 


i=r j=i 


(Ai - A,) 


(<t2 - A.)2 


< ^^-^^' ,(116) 
1 + e 
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or equivalently, 

1 + £ 


m > 


(y5TT-l)2 

( 


fern fcm \ \ \ 2 

^ ^ (Ai - A,)2 + 

1=1 J = 1 ' ' 1=1 ' ' 


then 


\\PsE‘-’^^\\% < (VjTT- l)^ 

holds with prohahility 


(117) 


(118) 


' > 1 — exp{ — 


km-n-e 

C 


}• 


Then comhining dB with dl 18b . we get 

||i;||2 < 2(V5 + 1 - 1) + (V5 + 1- if = S-1<S, (119) 

thus we have 

T[R(”^'>f < _i^s + 0 ( 5 *™+^), 

( 120 ) 

holds with overwhelming probability. 

Furthermore, according to the definition of elementary symmetric 
function of singular values in d86b and d99b . it can be verified easily 
that 


hence we have 


If the number m of sample data is large sufficiently, then we have 

I s 


= 1, 

(121) 

‘ +0(5'“™+^), 

(122) 


therefore 


km — dl “t“ , km, — ^2 , 

y(jj(m))2 ^ gd2 + 


(123) 

(124) 


On the other hand, using d84b in lemma |5] we can similarly obtain 
the corresponding result for non-target scenario. 

I det(A + B) - det(A)| < s;.,„_i(A)||f ;||2 + 0(||F;||^), (125) 

When m is large sufficiently, we have km = di and = Qc- 

According to d90b . we have 

detHQf^Y= yolX+d2ilQs,Qc]) 

Similar to discussion above, for any 0 < S < 1 and e > 0, if 
1 + £ 


m > 


(v'JTT-iy 

( dl dl \ „2 , , 

2 = 1 j = l ' ' 2=1 ' 'I 


2=L J = 1 


then 


\\pfE^'^'>\\l < {i/sTT-if, 

hold with probability 


> 1 — exp{ — 


dl ■ n ■ 

C 


} 


(127) 


(128) 


therefore 


_ r\Hs, Hc)\ < Sdi-i{QlPfQc)S + o(<5"), 

(129) 
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